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BACKGROUN D OF TT-TF. INVENTION 



1 . Field of Invention 

The present invention relates generally to the performance of garbage 
collection in object-based systems. More particularly, the present invention relates to 
enabling different garbage collectors to be implemented for use with a virtual 
machine substantially altering the virtual machine to support each garbage collector. 



2 . Description of the Related Art 

Within an object-based environment, threads are often used to satisfy requests 
for services. A thread may be thought of as a "sketch pad" of storage resources, and 
is essentially a single sequential flow of control within a computer program. In 
general, a thread, or a "thread of control," is a sequence of central processing unit 
(CPU) instructions or programming language statements that may be independently 
executed. Each thread in a multi-threaded environment has its own execution stack 
on which method activations reside. As will be appreciated by those skilled in the art 
when a method is activated with respect to a thread, an activation is "pushed" on the 
execution stack of the thread. When the method returns, or is deactivated, the 
activation is "popped" from the execution stack. Since an activation of one method 
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manner 



activate another method, an execution stack operates in a first-in-last-out 
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While maintaining multiple threads, a computing system may need to perform 
global operations which require synchronization, or control of all threads or a set of 
threads at a given time. One example of such a global operation is garbage collection. 
As will be appreciated by those skilled in the art, garbage collection is a method 
which allows memory storage associated with objects which are no longer alive to be 
automatically reclaimed. 



Garbage collection generally occurs in two phases. A first phase typically 
involves identifying objects which are no longer needed or no longer in use, while a 
second phase renders the storage associated with the unneeded objects available for 
reallocation. An object in a program is needed, or alive, at a given time if a program 
1 5 may access the object in the future. When an object is such that it will not be 

accessed in the future, the object is considered to be unneeded or dead. Garbage 
collection algorithms, or "garbage collectors," typically consider an object to be dead 
substantially only when a program has abandoned all pointers to the object, making 
future access to the object impossible. 



Within systems which use garbage collectors, e.g., within systems which 
include virtual machines, when it is desired for a garbage collector to be modified or 
replaced with another garbage collector, the virtual machine itself must also be 
modified to support the modified or replacement garbage collector. At times, it may 
25 be desirable to replace an existing garbage collector in a system with a different 
garbage collector to improve the efficiency of garbage collection and, hence, the 
performance of the system. Typically, the process of altering or changing a garbage 
collector is time-consuming as it often requires substantial modifications be made to 
the virtual machine to accommodate the altered or new garbage collector. 



30 



Therefore, what is desired is a relatively inexpensive method and apparatus for 
enabling a garbage collector associated with a virtual machine to either be modified 
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or changed. Specifically, wha, is needed is a method and an apparatus for enabling 
garbage collectors to be efficiently swapped into and swapped out of virtual mactune 



systems. 
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fjiiM Mftgy nr THE INVEN TION 

The present invention relates to methods and apparatus for enabling modular 
garbage collectors to be swapped into and swapped out of virtual machines without 
5 requiring that the virtual machines be altered. According to one aspect of the present 
invention, an interface for reclaiming memory in a mul<i-.hreade4 virtual mactane 
environment that has a virtual machine includes a firs, module mat is associated wtth 
the virtua! machine and a second module that is separate from the firs, module but 
interfaces with the firs, modu!e. The first module includes a first routine for inttiating 
,0 a garbage collection process within the virtual machine environment, a second routme 
for scanning roots associated with the virtual machine environment, and a tad 
routine for following roots through objects associated with the virtual maclune 
environment. The second module includes a fourth routine for initializing a heap m 
the virtual machine environment, a fifth routine for allocating an object in the vtrtual 
machine environment, and a sixth routine for performing the garbage collection 
process. In one embodiment, the second module also includes a seventh routine for 
implementing read and write barriers for the garbage collection process. 

These and other advantages of the present invention will become apparent 
20 upon reading the following detailed descriptions and studying the various figures of 



15 



the drawings. 
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vt ^TPV DESCRIPTI ON OF THE DRAWINGS 

The invention may best be understood by reference to the following 
description taken in conjunction with the accompanying drawings in which: 

5 

Figures 1A and IB are a diagrammatic representation of an environment 
which supports pluggable garbage collectors in accordance with an embodiment of 
the present invention. 

1 0 Figure 2 is a process flow diagram that illustrates the steps associated with the 

overall operation of a virtual machine in accordance with an embodiment of the 
present invention. 

Figure 3 is a process flow diagram that illustrates the steps associated with 
15 performing a garbage collection, e.g., step 212 of Figure 2, in accordance with an 
embodiment of the present invention. 

Figure 4 is a diagrammatic representation of a memory space in accordance 
with an embodiment of the present invention. 

20 

Figure 5 is a diagrammatic representation of a general-purpose computer 
system suitable for implementing the present invention. 

Figure 6 is a diagrammatic representation of a virtual machine suitable for 
25 implementing the present invention. 
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yYRT att pf> npgrRTPTTON of the EMBODIMENTS 

In the following description, numerous specific details are set forth in order to 
provide a thorough understanding of the present invention. It will be apparent, 
5 however, to one skilled in the art that the present invention may be practiced without 
some or all of these specific details. In other instances, well known structures or 
operations have not been described in detail in order to not unnecessarily obscure the 
invention. 

1 o A garbage collection framework which enables different garbage collectors to 

be developed and readily "plugged into" the framework without requiring changes to 
be made to a virtual machine increases the flexibility of systems which use the 
garbage collectors. For example, an improved garbage collector may be readily 
implemented in an existing system to increase the efficiency with which the system 

15 operates, thereby increasing overall system performance. 

In one embodiment, a virtual machine environment maintains a memory 
system substantially separately from a virtual machine, and separates common 
garbage collection routines, e.g., scanning routines, from algorithm-specific routines. 
20 To enable garbage collectors to be plugged into a virtual machine environment 

without affecting a virtual machine, algorithm-specific garbage collection routines 
may be maintained separately from common or shared garbage collection routines. 

Figure 1A is a diagrammatic representation of an environment which enables 
25 garbage collectors to be swapped in and swapped out without requiring that a virtual 
machine be altered in accordance with an embodiment of the present invention. An 
environment 102, e.g., a virtual machine environment, includes a virtual machine 104 
and a memory system 106 that is substantially separated from virtual machine 104. 
One example of a virtual machine will be described below with reference to Figure 6. 
30 Allowing memory system 106 to be substantially separated from virtual machine 104 
facilitates the ability to use different garbage collectors interchangeably with respect 
to environment 102. 
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Memory system 106 includes shared garbage collector code 108 which 
enables substantially any specific garbage collector, as for example garbage collector 
1 12, to be implemented with respect to memory system 106. The use of shared 
garbage collector code 108 enables specific garbage collectors, such as specific 
garbage collectors 1 12 and 1 14, to be used substantially interchangeably with respect 
to virtual machine 104 without requiring modifications to be made to virtual machine 
104. As shown in Figure IB, specific garbage collector 1 14 may be used to perform 
garbage collection on memory system 106 in lieu of specific garbage collector 112. 



In order for garbage collectors 1 12 and 1 14 to be substantially pluggable , 
garbage collectors 1 12 and 1 14 are written to interface with shared garbage collector 
code 108. Hence, as long as garbage collectors 1 12 and 1 14 are consistent with 
shared garbage collector code 108, garbage collectors 1 12 and 1 14 may be 
1 5 implemented without modifying virtual machine 1 04. 

In addition to shared garbage collector code 108, code associated with 
memory system 106, i.e., memory system code, which may be used by a virtual 
machine generally includes code that initializes heaps and allocates object. Shared 

20 garbage collector code 1 08 which is intended to be used by substantially any suitable 
garbage collection implementation may include, but is not limited to, code which 
stops thread execution at garbage collection safe points, code that identifies and scans 
exact roots, code that finds and scans references in objects or arrays, and code that 
handles special objects and cases. The special objects and cases may include weak 

25 references, finalization, string interning, and class uploading. 

Although the virtual machine environment in which modular or pluggable 
garbage collectors are used may be widely varied, in the described embodiment, the 
virtual machine environment is a Java™ virtual machine environment, developed by 
30 Sun Microsystems, Inc. of Palo Alto, California. In a Java™ environment, shared 
garbage collection interfaces or routines which a modular or pluggable garbage 
collection implementation, e.g., a developer-written implementation, uses may 
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include, but are not limited to, a "CVMgcStopTheWorldAndGCO" routine which 
effectively initiates a garbage collection, a "CVMgcScanRootsO" routine which scans 
for exact garbage collection roots, a "CVMobjectWalkRefs(obj, callback)" routine 
which walks through objects, and "CVMgcScanSpecialO" and 
"CVMgcProcessSpecialWithLivenessInfoO" routines which are arranged to handle 
special objects. If a garbage collection implementation utilizes such routines, then the 
garbage collection implementation, e.g., a specific garbage collector, may be 
efficiently plugged into a virtual machine environment. 

A specific garbage collection implementation that is pluggable will generally 
include algorithm-specific garbage collection code. Each garbage collection 
implementation may have its own type of heap layout, its own heap initialization 
process, and its own heap teardown process. Algorithm-specific code may also have 
specific object allocation processes, specific object reclamation algorithms, and 
specific read and write barriers. By way of example, the read and write barriers 
associated with generational garbage collectors may be specified in algorithm-specific 
code, and are substantially unaffected by the associated virtual machine environment. 
In one embodiment, if no implementation of read and write barriers are provided by 
algorithm-specific code, the barriers are considered to be empty. 

In general, in order for algorithm-specific garbage collection code to be 
recognized by shared garbage collection code once the algorithm-specific garbage 
collection code is interfaced with the shared garbage collection code, routines 
associated with the algorithm-specific code have names which are specified by the 
share garbage collection code. Within a Java™ environment, for example, an 
algorithm-specific heap initialization routine may be expected to be implemented as a 
"CVMgccimpInitHeapO" process. In other words, the shared garbage collection code 
may be arranged to make a call to a "CVMgccimpInitHeapO" method in order to 
initialize a heap. Similarly, an algorithm-specific object allocation routine may be 
expected to be implemented as a "CVMgcimplAllocObject(size)" method, and an 
algorithm-specific garbage collection performed once threads are stopped may be 
expected to be implemented as a "CVMcimplGCO" method. In addition, an 
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algorithm-specific read barrier may be expected to be implemented as either a 
"CVMgcimplReadBarrierRefO" method or a "CVMgcimplReadBarrier<T>" method, 
while an algorithm-specific write barrier may be expected to be implemented as either 
a "CVMgcimpWriteBarrierRefO" method or a "CVMgcimplWriteBarrier<T>" 
5 method. 



Figure 2 is a process flow diagram which illustrates the steps associated with 
the overall operation of a virtual machine in accordance with an embodiment of the 
present invention. A virtual machine allocates memory in step 204. In general, a 

10 virtual machine may allocate memory to objects as needed using a memory allocator. 
When there is insufficient memory to allocate objects, then the memory allocator 
typically decides to perform a garbage collection in order to reclaim memory space in 
step 208. Reclaiming memory space using a garbage collection, as will be 
appreciated by those skilled in the art, typically involves reclaiming memory space 

1 5 allocated to obj ects which may no longer be accessed. As will be appreciated by 
those skilled in the art, when a memory allocator decides to perform a garbage 
collection, a safe-point typically must be reached within the overall environment 
before a global operation, such as a garbage collection, which requires 
synchronization is performed. 

20 

In step 212, a garbage collection is performed. In the described embodiment, 
performing a garbage collection includes allowing shared garbage collection code, 
e.g., code that is shared by different garbage collectors, to interact with specific 
garbage collection code, e.g., garbage collection code associated with a specific 
25 garbage collector. One method of performing garbage collection will be described 
below with respect to Figure 3. After the garbage collection is performed, then 
process flow returns to step 204 in which the virtual machine is allowed to continue to 
allocate memory. 

30 Referring next to Figure 3, one method of performing garbage collection using 

a pluggable or modular garbage collector which includes shared code, e.g., code 
associated with a virtual machine, and code that is specific to the garbage collector 
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will be described in accordance with an embodiment of the present invention. That is, 
step 212 of Figure 2 will be described. A garbage collection begins at step 304 in 
which garbage collector specific code calls shared code to stop all threads at garbage 
collection points for the threads, e.g., at thread-specific safe-points. 

5 

All threads are stopped at the appropriate garbage collection points by the 
shared code in step 308. At garbage collection points, each thread of a system polls 
for garbage collection using, for example, method call points and backward branches 
in the shared code. In one embodiment, a thread at a garbage collection point may 
10 save its garbage collector scannable state and suspend itself. 

After all threads are stopped, then the shared code calls the specific code in 
step 312 to free up a predetermined number of bytes. In step 320, the specific code 
scans roots by calling methods included in the shared code. The roots may be 
15 scanned in order to process special roots by walking through the memory on which 
garbage collection is being performed. Once the roots are scanned, the specific code 
performs a sweep through the memory by calling shared code in step 324. The sweep 
enables the predetermined number of bytes to be freed, and the garbage collection is 
competed. 

20 

The garbage collector used to perform the garbage collection described in 
Figure 3 is a mark and sweep garbage collector, as will be appreciated by those 
skilled in the art. In general, the pluggable garbage collector of the present invention 
may be substantially any suitable garbage collector, and is not limited to being a mark 
25 and sweep garbage collector. 

In one embodiment, a generational garbage collector may be a garbage 
collector which is suitable for swapping into and swapping out of the system. A 
generational garbage collector may be used with respect to a memory space which is 
30 divided into a "young" space and an "old" space. Figure 4 is a diagrammatic 

representation of a memory space in accordance with an embodiment of the present 
invention. Within a memory space 402 or a heap, a division may be made to separate 
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the memory space into a "young" space 406 and an "old" space 410. A root set 41 8 
includes roots 422, e.g., exact roots, which reference objects 414 in memory space 
402. 

5 Within memory space 402, young space 406 is used for new object allocation, 

while old space 410 is used for tenured, or long-standing, objects 414, e.g., objects 
414e-g. That is, young space 406 is associated with a relatively young generation of 
objects 414 while old space 410 is associated with a relatively old generation of 
objects 414. A generational garbage collector which performs garbage collection 

10 within memory space 402 typically copies older objects 414 from young space 406 
into old space 410, then removes those objects 414 that were copied from young 
space 406, i.e., reclaims memory from young space 406 that was associated with 
objects 414 which were copied. A generational garbage collector would also remove 
dead objects 414, e.g., objects 414f and 414g, from both young space 406 and old 

15 space 4 1 0, as appropriate. 

In one embodiment, different garbage collectors may be used to perform 
garbage collection on young space 406 and old space 410. Specifically, in addition to 
being able swap different generational garbage collectors into and out of a system 

20 without modifying virtual machine code, young space 406 and old space 41 0 may 
each have a different pluggable garbage collector. A garbage collection framework 
may enable generational garbage collection code to be shared by individual garbage 
collectors for young space 406 and old space 410. The shared garbage collection 
code may be arranged to coordinate the timing between the individual garbage 

25 collectors, or the specific code associated with each individual garbage collector, that 
may be independently maintained. For example, the shared code may determine a 
point in time during which at least one of the individual or independent garbage 
collectors should perform a particular action. 

30 A shared interface for substantially pluggable generational garbage collectors 

may effectively invoke garbage collectors for each space or generation, e.g., a young 
generation or an old generation, based on need. For example, if a young space or 
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generation is full, then the shared interface may invoke a garbage collection on the 
young space. As will be understood by those skilled in the art, most objects die 
young. Hence, with respect to Figure 4, young space 406 is typically small in relation 
to old space 410. Garbage collection is, therefore, performed on young space 406 
frequently than it is performed on old space 410. Accordingly, the amount of 
spent on garbage collection may be reduced by implementing garbage 
collections independently on young space 406 and old space 410 on an as-needed 



more 
time 



basis. 



10 Figure 5 illustrates a typical, general-purpose computer system suitable for 

implementing the present invention. The computer system 1030 includes at least one 
processor 1032 (also referred to as a central processing unit, or CPU) that is coupled 
to memory devices including primary storage devices 1036 (typically a read only 
memory, or ROM) and primary storage devices 1034 (typically a random access 

1 5 memory, or RAM) . 

Computer system 1030 or, more specifically, CPUs 1032, may be arranged to 
support a virtual machine, as will be appreciated by those skilled in the art. One 
example of a virtual machine that may be supported on computer system 1030 will be 
20 describedbelowwithreferencetoFigure6. As is well known in the art, ROM acts to 
transfer data and instructions uni-directionally to the CPUs 1032, while RAM is used 
typically to transfer data and instructions in a bi-directional manner. CPUs 1032 may 
generally include any number of processors. Both primary storage devices 1034, 
1036 may include any suitable computer-readable media. A secondary storage 
25 medium 1038, which is typically a mass memory device, is also coupled bi- 
directionally to CPUs 1032 and provides additional data storage capacity. The mass 
memory device 1038 is a computer-readable medium that may be used to store 
programs including computer code, data, and the like. Typically, mass memory 
device 1038 is a storage medium such as a hard disk or a tape which generally slower 
30 than primary storage devices 1034, 1036. Mass memory storage device 1038 may 

take the form of a magnetic or paper tape reader or some other well-known device. It 
will be appreciated that the information retained within the mass memory device 
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1038 may, in appropriate cases, be incorporated in standard fashion as part of RAM 
1036 as virtual memory. A specific primary storage device 1034 such as a CD-ROM 
may also pass data uni-directionally to the CPUs 1032. 

CPUs 1032 are also coupled to one or more input/output devices 1040 that 
m ay include, but are not limited to, devices such as video monitors, track balls, mice, 
keyboards, microphones, touch-sensitive displays, transducer card readers, magnet* 
or paper tape readers, tablets, styluses, voice or handwriting recognizers, or other 
well-known input devices such as, of course, other computers. Finally, CPUs 1032 
optionally may be coupled to a computer or telecommunications network, e.g., an 
internet network or an intranet network, using a network connection as shown 
generally at 1012. With such a network connection, it is contemplated that the CPUs 
1032 might receive information from the network, or might output information to the 
network in the course of performing the above-described method steps. Such 
information, which is often represented as a sequence of instructions to be executed 
using CPUs 1032, may be received from and outputted to the network, for example, 
in the form of a computer data signal embodied in a carrier wave. The above- 
described devices and materials will be familiar to those of skill in the computer 
hardware and software arts. 

As previously mentioned, a virtual machine may execute on computer system 
1030 Figure 6 is a diagrammatic representation of a virtual machine which is 
supported by computer system 1030 of Figure 5, and is suitable for implementmg the 
present invention. When a computer program, e.g. , a computer program wntten m the 
Java- programming language, is executed, source code 1 1 10 is provided to a 
compiler 1120 within compile-time environment 1105. Compiler 1120 translates 
source code 1110 into bytecodes 1130. In general, source code 1110 is translated mto 
bytecodes 1130 at the time source code 1110 is created by a software developer. 



30 



Bytecodes 1130 may generally be reproduced, downloaded, or otherwise 
distributed through a network, e.g., network 1012 of Figure 5, or stored on a storage 
device such as primary storage 1034 of Figure 5. In the described embodunent, 
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decodes • .30 are platform independent. The, is, bytecodes , . 30 -nay be exec*ed 
oLbstantial.y any confute, system .ha, ,s running on a suitab.e vrrtual — 



1140. 



Bytecodes .130 arc provided to a runtime environment 1135 which includes 
virtual machine 1 140. Runtime environment 1 1 35 may generaUy be executing a 
processororprocessorssuchasCPUsl032ofF.gure5.Virh.al — 1140 

includes acompiler 1142, an interpreter 1144, andarun.,me system 1146. Bytecodes 
1 130 may be provided either to compiler 1 142 or interpreter 1 144. 

Whenbytecodes 1.30 are provided to compUer 1142, methods contained in 
decodes 1130 are compiled into machine instructions. In one embodm.en, 
clpiler 1 142 is aiust,n-t,me compi.er which de,ays the compile o, methods 
contained in bytecodes 1130 until the methods are about to be executed. When 

. , * , • + arnrpie . r 1 1 44 bytecodes 1 1 30 are read into 
15 bytecodesll30areprovidedtomterpreterll44 5 Dyie C 

i:crp re ,erll44onebytecodeata,ime. ^U^™^ 

interpreterll44"interprets"bytecodesll30,asw.liDe pp 

in the art. !n genera., interpreter . .44 processes bytecodes 1 130 and performs 
20 operationsassociatedwitttbytecodesllSOsubstantiaUyconnnuously. 

When a method is invoked by another method, or is invoked from runtime 

25 Ichmaybedirectlyexecutedbyintetpreter.^. .f, on the other hand, the me*od 

„46 a,so obtains the method from runtime environment 1 135 in the form of a 
sequence of bytecodes ,.30, then may go on, oac«va,e compiler 1.42. Cooler 
„42urcn B enera.esmachineinstructions fr omby,ecodesl.30,and«heresu,ung 

30 maia ^<^^^^^Z 

tne machine-language insttucnons are discarded when virtua, macmne .HO 

terminates. The operation of virtual machines or, more particularly, lava™ vrtua. 
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^^a^e^sor.ep^n,— ^ a^b» 

i„ve„.ion. By way of examp>e, ahhoug e ^ 

„een described as being suitabie for use m a Ja ^ 

+^ fnr use in substantially any suiwu 

Suitable environments include, but are 
a Smalltalk virtual machine. 

ofa.gori^-specmcgarbageconecuoncode. Thena, ^ 
of routines associa,ed wi.h *. hr.er.ace, ^""^ t sapD „r,ing 



interface. 
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• ♦ ^ that the steps associated with the various 
mgeneraUtshouldbeappreaatedthatthestep 

, , fthP nresent invention may be widely varic 
processes and methods of the present m , steps may also be 

me steps associated with the methods may be ahWo rf ^ 

TW,»fnre the present examples are vo u 
present invent,on. Therefore, th p „ the details given herem, 

m d no. restrictive, and the invenhon » no. to be Urn 
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